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ABSTRACT 

Wavelet-related techniques have proven useful in the processing and analysis of 
one and two dimensional data sets (spectra in the former case, images in the latter). In 
this work we apply adaptive filters, introduced in a previous work (Sanz et al. 2001), to 
optimize the detection and extraction of point sources from a one-dimensional array of 
time-ordered data such as the one that will be produced by the future 30 GHz LFI28 
channel of the ESA Planck mission. At a 4tr detection level 224 sources over a flux 
of 0.88 Jy are detected with a mean relative error (in absolute value) of 21% and a 
systematic bias of —7.7%. The position of the sources in the sky is determined with 
errors inferior to the size of the pixel. The catalogue of detected sources is complete 
at fluxes > 4.3 Jy. The number of spurious detections is less than a 10% of the 
true detections. We compared the results with the ones obtained by filtering with a 
Gaussian filter and a Mexican Hat Wavelet of width equal to the scale of the sources. 
The adaptive filter outperforms the other filters in all considered cases. We conclude 
that optimal adaptive filters are well suited to detect and extract sources with a 
given profile embedded in a background of known statistical properties. In the Planck 
case, they could be useful to obtain a real-time preliminary catalogue of extragalactic 
sources, which would have a great scientific interest, e. g. for follow-up observations. 
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1 INTRODUCTION 

One of the most thrilling challenges in the study of the 
Cosmic Microwave Background (CMB) is to deal with the 
problem of separating the cosmological signal from the dif- 
ferent foregrounds and noises that appear in CMB experi- 
ments. This problem will be specially relevant in future high- 
resolution experiments such as MAP (Bennett et al. 1996) 
and Planck (Mandolesi et al. 1998, Puget et al. 1998). From 
the point of view of determining constraints on fundamen- 
tal cosmological parameters, the different foregrounds (syn- 
chrotron emission, galactic dust, free- free radiation, thermal 
and kinetic Sunyaev-Zel'dovich (SZ) emission from galaxy 
clusters and extragalactic point sources) are considered as 
contaminants and therefore must be removed together with 
the noise in order to extract the cosmological signal. Be- 
sides, knowledge about each of these 'contaminants' has a 
great scientific relevance on itself. Therefore, it is of excep- 
tional importance to be provided with good techniques of 
denoising and foreground separation. 
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There are several methods already available to perform 
the separation of the different components in CMB observa- 
tions, such as the ones based on Wiener filter (Tegmark and 
Efstathiou 1996, Bouchet and Gispert 1999) and on Max- 
imum Entropy Methods (MEM, Hobson et al. 1998, 1999; 
Stolyarov et al. 2001). All these methods take advantage of 
the different statistical properties of the CMB and the fore- 
grounds (e.g. different angular power spectra) as well as the 
knowledge of their distinctive frequency dependence. When 
neither the spatial distribution of the foregrounds nor its 
frequency dependence are well known the performance of 
these separation methods dramatically drops. Such is the 
case of point sources, whose spatial distribution and abun- 
dance remains uncertain and whose frequency dependence 
is not well known. Besides, they can show temporal and 
even spectral variability. Extragalactic point sources should 
be removed from the maps before any analysis should be 
performed. Maximum Entropy Methods can produce a cat- 
alogue of point sources as a residual (noise) from the sep- 
aration process. However, only the faintest sources are re- 
covered, and the brightest ones can still be observed in the 
residuals. The Independent Component Analysis technique 



© 0000 RAS 



2 Herranz et al. 



(Baccigalupi et al 2000, Maino et al 2001) has also been ap- 
plied to this problem with promising results: it has the ad- 
vantage that, unlike the previous methods, it does not need 
any prior knowledge of the components to be separated, but 
its weakest point is actually the separation of point sources. 

Filtering techniques have been successful in both de- 
noising and extracting the brightest point sources from CMB 
maps. Tegmark and Oliveira-Costa 1998 presented a filtering 
scheme that minimizes the variance of the map. However, 
their filter does not produce a maximum gain (considered 
as the ratio between the cr-level of the source after filtering 
and the cr-level of the source before filtering) at the scale of 
the source. All the point sources in a CMB map will have 
the same profile and size (the convolution of a 5-Dirac source 
with the antenna beam) and the filtering process should take 
advantage of this fact using a filter optimal for that particu- 
lar profile and size (scale). Wavelet techniques are specially 
suitable to deal with scales and spatial location at the same 
time. The application of the Mexican Hat Wavelet (MHW) 
to realistic simulations was presented in Cayon et al. (2000) 
and extended in Vielva et al. (2000a). The principal advan- 
tage of this method is that no specific assumptions about the 
statistical properties of the point source population or the 
underlying emission from the CMB and other foregrounds 
is required. The MHW and MEM techniques have success- 
fully been combined to extract both faint and bright point 
sources at the same time (Vielva et al. 2000b). 

It is desirable to exploit to the maximum the mentioned 
scale distinctiveness of point sources. In Sanz et al. 2001 
(S01, hereafter), a method was presented to, for a given 
source brightness profile and given the statistical proper- 
ties of the background (CMB plus foregrounds and noise), 
derive the analytic shape of the optimal filter for the partic- 
ular considered case. That kind of filter was called 'optimal 
pseudo- filter'. S01 defines as an optimal pseudo-filter one 
that a) is unbiased, b) gives a maximum at the position and 
scale of the source and c) gives the minimum variance of 
pseudo-filter coefficients (i.e. is an efficient estimator of the 
amplitude of the sources) under assumptions a) and b) . They 
considered one, two and three-dimensional cases and showed 
that the MHW is optimal (in the mentioned sense) or nearly 
optimal for a two-dimensional Gaussian profile and reason- 
able foreground conditions. After carefully thinking about 
the terminology used in that previous work, we have de- 
cided to substitute the rather confusing 'pseudo-filter' term 
by 'adaptive filter'. This is so because a filter that satisfies 
conditions a) to c) adapts itself to the characteristics of the 
signal and the noise. In the following we will use 'adaptive 
filter' in the same sense as 'pseudo-filter' in S01. 

Although two-dimensional maps are the most useful and 
extended form to show CMB data, they are only available 
after an exhaustive process of analysis and reduction of the 
raw data from CMB experiments. These experiments scan 
different patches of the sky in a sequence producing a uni- 
dimensional set of time ordered data (TOD). TODs suffer 
from lower signal-to-noise ratios than final two-dimensional 
maps, but on the other hand are less likely to show artifacts 
coming from the data reduction, such as pixel-to-pixel noise 
correlations. Moreover, TODs can be analyzed in (almost) 
real time during the observations in order to produce early 
(preliminary) catalogs of sources. 

In this paper we apply the techniques developed in S01 



to realistic simulations of the TOD coming from one of the 
30GHz Planck LFI's channels. In section ^| we summarize 
some of the conclusions of S01 and present the semi-analytic 
adaptive filter that should be used in a realistic case. Sec- 
tion 3 describes the simulations used in this work. In sec- 
tion 3 we describe the analysis of the simulated data. In 
section [| we describe the performance of the optimal adap- 
tive filter, comparing it with other filtering schemes such as 
Gaussian filter and MHW. Finally, in section ^| we discuss 
our conclusions and give an outline of future work in this 
field. 



2 ONE-DIMENSIONAL ADAPTIVE FILTER 

The general formalism of adaptive filters in an n-dimensional 
image was presented in S01. TODs can be considered as 
a particular case of a one-dimensional image (a spectrum 
is another interesting case). The image data values can be 
expressed as 



y{t) = s(t)+n(t), 



(1) 



where t is the time, s(t) represents a symmetric source and 
n(t) is a homogeneous and isotropic background with mean 
value < n(t) > = and characterized by the power spectrum 
P(q) (q is the absolute value of the 'wave vector' associated 
to t). If A = s(0) is the amplitude of the source we can 
introduce the profile r(t) as s(t) — Ar(t). Let us introduce 
a centrally symmetric adaptive filter fe, 7?) 



(2) 



where b defines a translation whereas R defines a scaling. 
The adaptive filtered field is defined as: 



w(R,b) 



dty(t)ty(t;b,R) 



dqe-'VybMRq) (3) 



In the last equivalence we have expressed the convolution as 
a product in Fourier space. 

Following S01, it is possible to derive the analytic form 
of the optimal adaptive filter for a given source profile, once 
we have defined what we mean by 'optimal'. Let us define 
an optimal adaptive filter as one that satisfies the following 
conditions: 

(i) There exists a scale R such that at the point source 
position (6 = 0) < w(R, 0) > has a maximum at that scale. 

(ii) < w(R o ,0) >= s(0), i.e. w(R o ,0) is an unbiased esti- 
mator of the amplitude of the source. 

(iii) The variance of w(R, b) has a minimum at the scale 
R , i.e. we have an efficient estimator. 

Given these three conditions, the solution (adaptive fil- 
ter) is found to be: 



i>(q) = i>(R q) = [b + c-(a + b) dh> T 



2P(q)A l 



ding 



(4) 



a = / dq — , b = 



dq- 



dr 



t dr 
l Pdlnq' 



A = ac—b 



'P l dlnq' 

The limits of the integrals go from to oo. 



(5) 
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In SOI analytic expressions for Gaussian sources and 
backgrounds of the type P{q) oc g -7 were derived. In a 
more realistic case, the background can not be modelled in 
such a simple way, and integrals in (g) should be numerically 
estimated. When dealing with real data (or realistic simu- 
lations such as those used in this work) we must perform 
the following steps: first, determine the power spectrum of 
the background directly from the image. Second, evaluate 
integrals (5|). Third, build the adaptive filter Q and make 
the convolution ([J). Finally, we can proceed to detect the 
sources, for example looking for peaks above a certain a- 
level in the coefficient (filtered) image. 

When determining the power spectrum of an image we 
obtain the power spectrum of both the background and the 
sources together. In the following we consider that the con- 
tribution of the point sources to the total power spectrum is 
negligible. This is a reasonable assumption in a realistic case, 
specially at medium and high wavelengths where the emis- 
sion of IR and radio sources is weak. Another problem re- 
lated to power spectrum determination is the variance of the 
power spectrum estimator. The variance of one-dimensional 
power spectrum estimators tends to be larger than that of 
two-dimensional cases because the number of sample points 
is usually smaller. A possible solution to this problem is 
to estimate low resolution power spectra. However, a large 
amount of information is lost at scales that are crucial for 
the determination of the adaptive filters. For this work we 
chose to average the estimated power spectra of adjacent 
rings of the TOD (corresponding to regions of the sky sep- 
arated by only a few arcminutes and therefore possessing 
very similar underlying power spectra). 

A typical profile of the optimal adaptive filter (in 
Fourier space) for a section of the simulated TOD used in 
this paper is shown in figure |l[ The profile (solid line) is 
irregular due to the roughness of the estimated power spec- 
trum. These irregularities reflect the particularities of the 
data and define the scales where the adaptive filter is more 
or less efficient. For comparison, a Mexican Hat Wavelet 
(dashed line) and a Gaussian (dotted line), both of them 
with a width equal to the width of the source, are also shown. 



3 DATA 

For this work we chose the 30 GHz LFI28 channel of Planck 
because of the relatively small size of its resulting TOD. 
Higher frequencies, in spite of being expected to show more 
contribution from point sources due to their higher resolu- 
tion and the contribution of IR sources at v > 300 GHz, lead 
to huge TOD sets and require many hours of computation 
to be simulated and analysed on an average computer and 
will be analyzed in a future work. 

Our data simulates a 6 months run of the 30 GHz LFI28 
channel of Planck, covering the whole sky except for two 
circles of 1°82 around the ecliptic poles. The sky simulation 
consist of the addition (in flux) of a CMB simulation and 
three templates, each for the Galactic synchrotron radiation, 
Galactic dust and thermal SZ. These templates have been 
generated using proper models that are freely distributed 




Figure 1. Several filters in Fourier space. The optimal adaptive 
filter particularised for an individual ring of our simulations is 
represented by a solid line. The dashed line shows a Mexican Hat 
Wavelet of width equal to the width of the (Gaussian) source. The 
dotted line represents a Gaussian of width equal to the width of 
the source. All filters are normalized to give the true amplitude 
of the source after convolution. 



among the Planck collaboration^. This sky is then combined 
with a map of point sources, generated from a catalog per- 
formed following the model of Toffolatti et al. 1998 (that is 
not public for the Planck collaboration) and then 'observed' 
by the Planck Pipeline Simulator for one of the LFI 30 GHz 
radiometers; resulting in a TOD for the observed sky (CMB, 
foregrounds and point sources) plus instrumental noise. The 
data are ordered in circular rings centered on points situated 
on the Ecliptic. The angle between the pointing axis and the 
rotation axis for the LFI28 instrument is 88° 18. The max- 
imum separation between two consecutive rings is 2.5' (at 
the intersection with the Ecliptic plane). Each ring results 
from the average of 60 revolutions of the detector around 
the rotation axis, corresponding to one hour of integration 
time, and contains 1950 measures of antenna temperature. 
There are a total of 4383 of such rings, leading to 8546850 
measures of temperature. The antenna has a FWHM of 33' 
and its response slightly differs from a circular Gaussian one. 

The simulation contains CMB emission, different ex- 
tended foregrounds (Galactic synchrotron, dust and free- 
free, thermal and kinetic SZ emission from clusters, etc), 
point sources and instrumental noise. Both white noise and 
1// noise are present. The knee frequency /„ is set to be 
< 20 mHz (less than the frequency of rotation) . In the lower 
panel of figure |^ we show a segment of one of the rings of 
the simulation. There is a bright point source near pixel 400 
(in fact, it is the brightest source in the simulation). Apart 
from this extraordinarily bright source, the features that 
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dominate the image are Galactic emission (the large peaks 
around pixels 150 and 940) and noise. 



4 DATA ANALYSIS 

The complete set of simulated TOD was filtered using the 
optimal adaptive filter described in section |S| Each individ- 
ual ring was filtered separately. The power spectrum that 
appears in equations (Q) and (g) was obtained by averaging 
the estimated power spectra of twenty-one consecutive rings 
(the ring that is being filtered, the ten previous rings and 
the ten subsequent rings). 

In order to detect the sources from the filtered image 
one can set a certain threshold over the dispersion of the fil- 
tered ring and look for connected regions (peaks) above that 
threshold. That would be the most direct detection method 
if the different rings were independent of their neighbours. 
This is not the case, since adjacent rings scan very close re- 
gions and each source is expected to appear in more than 
one ring. The use of information coming from neighbouring 
rings allows to increase the effective signal to noise ratio of 
the detections and to discard spurious 'sources' due to noise 
fluctuations. The most straightforward way to detect sources 
is then to perform a kind of two-dimensional thresholding, 
looking for connected pixels at equal latitude that appear 
in several adjacent rings. In a first approximation, the po- 
sition of the source will be the position of the maximum 
of that region of connected pixels. We will show that this 
approximation is good enough for the purpose of locating 
the sources with an error comparable to 1/3 of the antenna 
FWHM. 

Two different regimes of 'noise' have to be removed in 
order to optimize the detection of the sources. The large 
scale features due to Galaxy foregrounds as well as CMB 
fluctuations are strongly correlated between a given ring of 
data and its neighbours. On the other hand, the small scale 
noise, dominated by white instrumental noise, is expected 
to be nearly independent from one ring to another. This 
suggests a further step in the idea of combining information 
from nearby rings to increase the signal to noise ratio of 
the sources. By averaging each ring data (before filtering) 
we can construct a 'synthetic TOD' in which the large scale 
fluctuations are almost the same than in the original TOD 
but the noise at the scale of the pixel has been greatly di- 
minished. The 1// will still be present in the averaged TOD. 
The point sources of the original TOD are replaced in the 
averaged TOD by Gaussians of amplitude 




i=i 



where A is the true amplitude of the source, N a is the num- 
ber of rings that are averaged, <T(, is the beam width and 
the distance | x — £i j 2 is the geodesic (spherical) angular 
distance between the pixel corresponding to the position of 
the source and the pixel that is being added to calculate the 
average. This is true when the beam is a perfect Gaussian 
because in that case the average is a weighted sum of Gaus- 
sians of equal width, that is, a new Gaussian of the same 
width and amplitude given by eq. (^|). Therefore, we can 
filter the rather denoised, 'synthetic TOD', instead of the 



original TOD and correct the amplitude of the sources that 
we will detect using eq. (^|) in order to recover an estimate 
A of the true amplitude A. 

Although the Planck Pipeline Simulator uses a realistic 
beam, we assumed for this work that the beam is a perfect 
Gaussian. In section |E| it will be shown that this approxi- 
mation works reasonably well. However, future work should 
take into account real beams; the 30 GHz detector is ex- 
pected to be slightly elliptic. This asymmetry will introduce 
complicate effects in the map making and analysis. In the 
case of TOD the effect of ellipticity, when projected over 
the scan trajectory, is a change in the 'effective width' of 
the beam. Therefore, the scale-adaptive filter should be cal- 
culated for such 'effective width'. More complicated asym- 
metries will require more careful treatment. Future work will 
deal with this issue. 



5 RESULTS 

In order to determine the goodness of the averaging of close 
rings we performed two different filterings of the TOD. In 
the first case, the raw data of the TOD were filtered ring 
by ring as described in the section [|. In the second case, a 
'synthetic TOD' in which each individual ring was the av- 
erage of the original ring (at that position) with the twelve 
neighbours was constructed. The number of rings averaged 
is such that at the maximum ring spatial separation region 
on the sky (the ecliptic Equator) the separation between 
the two most separated rings is approximately equal to the 
FWHM of the beam. After filtering, the detection and ex- 
traction was performed looking for sets of 5 or more con- 
nected pixels (in the 2-dimensional sense). A lesser number 
of connected pixels required would lead to a much higher 
number of spurious detections. To check if the detections 
correspond to real sources we compare with the catalogue of 
the 1000 brightest sources present in the simulations. This 
catalogue is complete and its flux limit is clearly lower than 
the expected flux limit of the detected sources. 

In the lower panel of figure [] the number of detections 
(defined as the number of peaks encountered above a certain 
threshold that correspond to a real source in the reference 
catalog) above several a thresholds is shown for the case 
where the data have been averaged (filled circles and solid 
line) and the case where they have not (open circles and 
dashed line). The remaining quantities of interest (such as 
estimation of the amplitudes, fluxes, etc.) concerning the 
detection/extraction of sources over the averaged and then 
filtered TOD are shown in table [l] and will be discussed later. 
As expected, the number of sources detected when analysing 
the synthetic TOD is 4 to 5 times higher to the number of 
sources detected over the original, filtered TOD for every a 
level. 

Apart from having the greatest number of detections, it 
is also of great importance to reduce as much as possible the 
number of spurious detections. In the lower panel of figure ^ 
(labeled as 'simple detection') the ratio between spurious 
and 'true' detections is represented for the case where the 
data have been averaged (filled circles and solid line) and 
the case where they have not (open circles and dashed line). 
The ratio is lower for the case in which the data have not 
been averaged. It is evident that a kind of compromise has 
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Figure 2. A 1024-pixel section of one of the rings before and after filtering. The lowest panel shows the section before any filtering. The 
second panel shows the section after filtering with a Gaussian of FWHM 33' (equal to the one of the source). The third panel shows 
the section after filtering with a Mexican Hat Wavelet of width equal to the width of the source. The top panel shows the section after 
filtering with the optimal adaptive filter. 



to be reached between gain and reliability. If we arbitrarily 
set a maximum proportion of spurious sources versus true 
ones, say a 10% (represented in fig. ^| with an horizontal 
dashed line), we can determine the minimum a level that 
satisfies this condition. In this example, for the case of non- 
averaged rings, we can reach the 3a and find 80 sources (to a 
minimum flux of 4.33 Jy) with a 7.5% of spurious detections. 
For the case of averaged rings, we must go to the 4a level, 
where we find 224 sources (to a minimum flux of 0.885 Jy) 
with a 9.8% of spurious detections. We conclude that the 
averaging of neighbouring rings is a valid strategy to reduce 



pixel-scale noise. In the following, all the results will refer to 
filtering of averaged rings. 

The number of detections and spurious sources found 
with the optimal adaptive filter applied to a synthetic TOD 
is shown in table |l| The determination of the position of the 
source, the mean relative absolute error in the determina- 
tion of the amplitude (defined as m.r.a.e. =\ Aq — A e \ /Aq, 
where A e is the estimated amplitude and Ao is the real am- 
plitude), the mean bias (defined in the same way as the 
m.r.a.e, but without the absolute value), the minimum flux 
reached and the completeness flux are also included in ta- 
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Table 1. Detections at 30GHz with the Optimal Adaptive filter, compared to a Mexican Hat 
Wavelet (MHW) and a Gaussian window with FWHM of 33'. Col. (1): a detection level. Col. 
(2): Number of sources found. Col. (3): Number of spurious detections. Col. (4): Mean position 
offset. Col. (5): Mean relative absolute error of the amplitude (defined as r.a.e. =| AQ—A e \ /Aq, 
where A e is the estimated amplitude). Col. (6): Mean bias in the amplitude. Col. (7): Minimum 
reached flux. Col. (8): Flux over which the catalog of detections is complete. 
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ble |l| In each case the mean error in the position of the 
sources is comparable with the size of the 'pixel' of the TOD, 
11'. The determination of the amplitude using eq. (^|) has rel- 
ative errors ranging from 24.32% at 3<r threshold to 19.82% 
at 5.0a. The error decreases as the detection threshold in- 
creases. This indicates that the estimation of the amplitude 
of weak sources is less accurate than the estimation of the 
amplitude of bright ones. Also, the determination of the am- 
plitude is biased to higher values at low a levels and to lower 
values (negative bias) at high a levels. The positive bias for 
weak sources arises due to the peak finding algorithm: it 
finds preferently the maxima in pixels where the noise con- 
tribution is positive. 

Let us consider for a while the possible causes for the 
mentioned bias and how to deal with it. Scale-adaptive fil- 
ter is designed to be an unbiased estimator of the amplitude 
of the sources. Thus, the method is unbiased. However, in 
practise it is found a small bias. The origin of this bias can 
be found in two different kind of effects. On the one hand, 
the bias introduced by the detection method: this bias was 
described in the last paragraph and is related with the well- 
known 'detection bias'. On the other hand, bias may arise 
due to the non-ideality of the data. In first place, the profiles 
considered in the design of the filters are continuous whereas 
real data is pixelised. Therefore, the correlation between the 



(continuous) filter and the (pixelised) source profile is not 
perfect, this resulting in a non-ideal performance of the fil- 
ter. In second place, the limits in the integrals (^) are from 
to oo, whereas in real data the frequencies are limited by 
the sizes of the data set and the pixel. Therefore, integrals 
(^) can only be approximately calculated. This leads to an 
inaccuracy in the determination of the shape of the filter 
and, more important for the bias, the normalization (the 
A in the mentioned integrals). Another source of bias ap- 
pears when the assumption about the profile of the sources 
is wrong. For example, we have assumed that the detector 
beam is Gaussian, while in fact it is not; this could explain 
the bias in our results. In SOI it was found using simple 
simulations that the bias due to the non-ideality of the data 
was negative. Similarly, the negative bias that appear here is 
interpreted in the way described above. In practise, it is dif- 
ficult to distinguish from the data the different contributions 
to the bias among the ones above mentioned. A possibility to 
overcome the bias is to calibrate it using a great number of 
simulations with the same background and artificial sources 
of known amplitude. Once removed the 'systematic' bias, 
the remaining error will be statistical. This kind of study 
will be carried on in a future work. 

In the case we are considering the bias is never greater 
than 16%. At intermediate thresholds, the (positive) effects 



© 0000 RAS, MNRAS 000, 000-000 



Point source detection and extraction from simulatedPlanck TOD using optimal adaptive filters 7 



of detection bias and the non-ideality of the data (negative) 
tend to cancel and, on average, the amplitude estimates are 
unbiased. To compare with other 'classical' filters we re- 
peated the process using a Gaussian filter and a Mexican 
Hat Wavelet (MHW, hereafter), both of them with a width 
equal to the beam width of the antenna (33'). The normal- 
ization of both filters was chosen so that the coefficient at the 
position of the source is equal to the amplitude of the source 
(that is, the filtering process does not change the amplitude 
of the sources). The adaptive filter automatically satisfies 
this condition (condition 2 for an optimal adaptive filter). 
While the MHW and the optimal adaptive filters are both 
band-pass filters, the Gaussian is a low-pass filter, so the 
comparison with the Gaussian is a bit unfair: the Gaussian 
is expected to perform significantly worse than the other 
two filters. The number of detections above several thresh- 
olds for the two 'classical' filters together with the optimal 
adaptive filter are shown in table The number of detec- 
tions is similar for the MHW and the optimal adaptive filter, 
yet are slightly higher for the optimal adaptive filter. The 
lowest number of detections corresponds to the Gaussian fil- 
ter. In the lower panel of figure |^ the number of detections 
with the three different filters is compared. Detections with 
the optimal adaptive filter are shown with open circles and 
solid line. The open boxes and dashed line corresponds to 
MHW detections and the triangles and dashed line corre- 
sponds to Gaussian filter detections. The ratio between spu- 
rious and true detections for the three filters is shown in the 
top panel of figure ^. Except for the 2.5a level, the Gaussian 
filter produces the worst ratio. The optimal adaptive filter 
gives spurious to detected (e/d) ratios that quickly decline 
with increasing a thresholds. The e/d ratio for the MHW 
remains almost constant with a in the considered cases and 
clearly exceeds the ratio obtained with the optimal adaptive 
filter. The m.r.a.e. and the bias in the determination of the 
amplitude are huge in the case of the Gaussian filter. Both 
have very similar values. That means that the main source 
of error is systematic (the filter is biased). Considering the 
m.r.a.e, the MHW seems to give amplitude estimates a few 
percent better than the optimal adaptive filter. Flux limits 
are similar in the MHW and optimal adaptive filter cases. 
The Gaussian filter leads to higher inferior flux and com- 
pleteness limits. 

Figure U provides a useful insight into what is happening 
with the different filters. The Gaussian filter smoothes the 
image, removing very efficiently the small scale noise but al- 
lowing the large structures (the Galaxy and others extended 
fluctuations) to remain in the image. The naive a threshold- 
ing counts these bright, large structures as sources, leading 
to a big relative number of spurious detections. Besides, the 
sources that by chance lie on 'valleys' of the background can 
not be detected. On the other hand, sources that lie on areas 
of positive background are enhanced and can be more easily 
detected. This can explain the large and positive bias in the 
detections with the Gaussian filter. 

Both MHW and optimal adaptive filter are better pre- 
pared to deal with this problem than the Gaussian window. 
Their profiles in Fourier space drop to zero at low frequencies 
and thus they are efficient in removing large scale structures. 
Images filtered with the MHW and the optimal adaptive fil- 
ter in figure |^ are similar. A visual inspection reveals that 
MHW smoothes better the high-frequency fluctuations. The 
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Figure 3. 30 GHz detections with the optimal adaptive filter 
as a function of the <r detection threshold. The open circles and 
dashed lines represent detections over the filtered TOD. The filled 
circles and solid line represent detections over a filtered synthetic 
TOD in which each ring is the result of the average of 9 rings 
of the original TOD. The synthetic TOD has the same number 
of rings than the original one. Two cases have been represented: 
simple detection over a certain threshold (lower panel) and simple 
detection excluding a 5 deg band around the Galactic plane (top 
panel). 



optimal adaptive filter is more efficient in removing medium 
and large structures. The fluctuations around pixel 150 (cor- 
responding to one of the two observations of the Galaxy in 
the ring) are better removed with the optimal adaptive fil- 
ter than with the MHW. This is more apparent in figure [l] 
where the profiles of the different filters in Fourier space 
clearly indicate the faster drop of the adaptive filter at low 
frequencies (large scales) and its slower drop at high frequen- 
cies (small scales) . The MHW can have the same problem as 
the Gaussian filter. However, the probability of this problem 
should be smaller due to its better efficiency in removing the 
Galaxy and other large-scale structures. This effect explains 
the higher e/d ratio and the trend to positive bias. 

We conclude that the optimal adaptive filter detects 
point sources better than the MHW and the Gaussian win- 
dow. The number of detections is comparable to the number 
of detections with the MHW and clearly higher than with 
the Gaussian window. The relative number of spurious de- 
tections with the optimal adaptive filter is lower, except for 
very low detection thresholds, than with the MHW and the 
Gaussian window. Over 4a the contamination of spurious 
detections is lower than 10%. At this level (4a) the number 
of expected sources in all the sky is of a few hundreds (224 
in our simulation) above fluxes of around 0.9 Jy. 

Some tests can be performed in order to discover if the 
number of spurious sources, the most unpleasant effect of 
the filtering and detection process, can be reduced. Where 
do these spurious detections come from? One possibility is 
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Figure 4. 30 GHz ratio between the number of spurious sources 
and the number of true detections with the optimal adaptive filter 
as a function of the a detection threshold. The open circles and 
dashed lines represent detections over the filtered TOD. The filled 
circles and solid line represent detections over a filtered synthetic 
TOD in which each ring is the result of the average of 9 rings 
of the original TOD. The synthetic TOD has the same number 
of rings than the original one. Two cases have been represented: 
simple detection over a certain threshold (lower panel) and simple 
detection excluding a 5deg band around the Galactic plane (top 
panel). 

that the peak finding algorithm is detecting the Galaxy or 
other large-scale features. Such structure will appear in sev- 
eral adjacent rings, as sources do, and therefore a possibility 
of confusion exists. To test this potential source of contam- 
ination, we repeated the analysis excluding a band of 5deg 
centered on the Galactic plane (corresponding to a 4.36% 
of the sky area). The top panels of figures g and [| show 
the number of detections and the ratio of spurious/true de- 
tections, respectively, for the optimal adaptive filter in the 
cases where the rings have been averaged (as explained be- 
fore, filled circles) and where they have not (open circles). In 
the first section of table ^| are the results for the case of av- 
eraged rings (the tabulated quantities are the same than in 
table Q) are shown. The decrease in the number of detections 
corresponds to the one expected for a uniform distribution 
of sources in the sky (around 5%). This indicates that the 
density of detections around the Galactic plane is not sub- 
stantially different from the density of detections in other 
regions, less 'contaminated', of the sky. This can be seen in 
figure |(| where the 4a detections have been represented in 
Galactic coordinates (the Galactic Plane being represented 
by a dashed line) . 

The ratio between spurious and true detections remains 
almost untouched for low a. For higher a levels, the pro- 
portion of spurious detections that are due to the Galactic 
plane increases dramatically (comparing tables |l| and ^ we 
see that this proportion increases from a 7% at 2.5<r to a 
89% at 5a). This indicates that the contamination by the 



Galaxy dominates at high signal to noise ratios, whereas 
low-intensity contamination is dominated by noise fluctua- 
tions. In all cases, the number of peaks that correspond to 
the Galaxy is much smaller than the number of detected 
sources, thus implying that the optimal adaptive filter deals 
efficiently with large scale backgrounds. 

We stated before that the higher number of spurious 
detections of MHW could be due to its non-optimal per- 
formance at large scales. To further test this hypothesis 
we repeated the test for the MHW, now considering only 
the peaks found outside a 5° band centered in the Galactic 
plane, as we did with the optimal adaptive filter before. The 
results are shown in the last section of table ^[ Most of the 
spurious sources (specially at high a) lied near the Galac- 
tic plane, as we expected. And yet the remaining number 
of spurious sources is still greater than in the equivalent 
optimal adaptive filter case. The number of detections and 
the flux limits remain similar to the optimal adaptive filter 
case. The m.r.a.e. is also similar in the two cases. Finally, the 
mean bias in the determination of the amplitude is negative 
for high a thresholds, as happens with the optimal adaptive 
filter. Note that the MHW used in this work has the same 
scale as the antenna. In fact, that scale is not the optimal 
for detection (Vielva et al. 2000a). The optimal scale of the 
MHW for a particular case has to be determined from the 
power spectrum of the data. This in a certain way mimics 
the determination of the optimal scale Ro that is automati- 
cally included in the optimal adaptive filter via eqs. (Q) and 
(^). The MHW at its optimal scales resembles the shape 
of the optimal adaptive filter in Fourier space (in the sense 
that its maximum is located near the maximum of the opti- 
mal adaptive filter) and thus the effectiveness of both filters 
should be similar. 

In order to further decrease the number of spurious de- 
tections, we could take advantage of the fact that, in many 
realistic cases, due to the sky coverage of the experiment and 
its scanning strategy, many positions of the sky can be mea- 
sured more than once (that is, at different epochs). For ex- 
ample, a source of Odeg latitude will be detected once when 
the center of the ring is located on longitude (f> — (f) source — R, 
being R the radius of the ring, and once again several months 
later, when the center of the ring is located on longitude 
4> = 4>source + R- When this occurs, it would be possible to 
almost duplicate the amount of information in some areas of 
the sky and therefore improve both the sensitivity and the 
reliability of the detection. However, such a refined detec- 
tion can not be done when the data do not cover the whole 
sky, and therefore is useless for the construction of an early 
catalogue of sources during the mission's flight. 



6 CONCLUSIONS 

We have simulated and analysed a sequence of time ordered 
data such as the one that the future Planck 30 GHz LFI28 
channel will produce after the first 6 months of flight. The 
data include all the main foregrounds as well as CMB fluc- 
tuations, point sources and instrumental noise. The result- 
ing TOD has been ring-averaged in order to remove pixel- 
scale noise and then filtered with an optimal adaptive filter 
that includes the spectral properties of the data in order 
to maximize the detection of sources of a particular shape 
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Table 2. Different tests for the detections at 30 GHz. Columns have the same meaning than 
in table |l]. Two different cases are tabulated: detections with the optimal adaptive filter when 
a 5 deg band around the Galactic plane is excluded from the analysis and detections with a 
33' MHW when a 5 deg band around the Galactic plane is excluded from the analysis. 



<j detected spurious mean offset m.r.a.e. < bias > min. flux compl. flux 
sources sources (') (%) (%) (Jy) (Jy) 



Optimal Adaptive filter, excluding the Galaxy 



2.5 


520 


1272 


14.06 


23.79 


2.67 


0.542 


13.722 


3.0 


379 


257 


13.51 


23.69 


0.58 


0.644 


13.722 


3.5 


280 


34 


12.67 


20.87 


-6.04 


0.760 


13.722 


4.0 


212 


8 


12.45 


20.05 


-8.97 


0.885 


13.722 


5.0 


142 


1 


12.16 


18.73 


-16.25 


1.066 


13.722 


5.5 


118 


1 


11.94 


18.97 


-17.22 


1.197 


13.722 






Mexican Hat Wavelet, excluding the Galaxy 






2.5 


454 


347 


12.76 


20.21 


2.79 


0.531 


13.72 


3.0 


342 


87 


12.54 


19.04 


0.42 


0.693 


13.72 


3.5 


256 


25 


12.49 


18.28 


-2.37 


0.806 


13.72 


4.0 


187 


14 


12.15 


15.71 


-7.65 


0.844 


13.72 


5.0 


129 


7 


12.18 


15.07 


-10.64 


0.957 


13.72 


5.5 


108 


7 


12.01 


14.60 


-12.37 


1.099 


13.72 



(Gaussian) and scale (the scale of the antenna). The opti- 
mal adaptive filter was designed to produce an unbiased, 
efficient estimator of the amplitude of the sources at their 
position and to give a maximum of detections at the char- 
acteristic scale of the sources. The detection of the sources 
was performed by thresholding the filtered TOD and look- 
ing for connected sets of peaks belonging to adjacent rings. 
At a 4ct detection level 224 sources over a flux of 0.88 Jy 
are detected with a mean relative error (in absolute value) 
of 20.98% and a systematic bias of —7.69%. The position of 
the sources in the sky is determined with errors inferior to 
the size of the antenna. The catalogue of detected sources 
is complete at fluxes > 4.337 Jy. The number of spurious 
detections is 22. 

The performance of the optimal adaptive filter has been 
compared with the performances under the same conditions 
of a Gaussian window and a Mexican Hat Wavelet (MHW) 
of width equal to the beam width. The number of sources 
detected with the optimal adaptive filter is comparable to 
the number of sources detected with the MHW and much 
higher than the number of sources detected with the Gaus- 
sian. However, the optimal adaptive filter finds a signifi- 
cantly lower number of spurious sources than the other two 
filters. This is due to the fact that the optimal adaptive filter 
removes better the large scale fluctuations (e.g. the Galaxy) 
than the other two filters. To test this hypothesis a 5 deg 
band around the Galactic Plane was removed and the anal- 
ysis was repeated with the optimal adaptive filter and the 
MHW. This test shows that most of the spurious detections 
that were found with the MHW were located on the Galac- 



plitudes with the Gaussian window suffer from very large 
errors. In most cases, the Gaussian detects only the Galaxy. 

The number of spurious detections can be reduced by 
means of further analysis after the filtering. For example, 
when the sky coverage of the scan is big enough some areas 
of the sky can be observed twice or more times, increasing 
the signal to noise ratio of the sources that lie in such areas. 
This point is out of the scope of the present work, in which 
we only present the filter as a first step in the data reduction. 

In conclusion, the optimal adaptive filter is an efficient, 
unbiased and reliable tool for the detection and extraction 
of punctual sources from TOD. A few hundred of sources 
over 1 Jy will be detected in the 30 GHz channels of the 
future Planck mission with < 10% of spurious detections. 
The catalogue will be complete at fluxes > 4 Jy. One possible 
application of this technique would be the elaboration of 
early catalogues of sources. A greater number of detections is 
expected at higher frequencies. The simulation and analysis 
of such frequencies will be performed in a future work. 
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the optimal adaptive niter are umtormly distributed m tne 
sky. 

The mean absolute error in the determination of ampli- 
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filter than in the MHW case. The estimation of the am- 
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Figure 5. Comparison of the performances of three different filters over the 30 GHz TOD. Open circles and solid line refer to optimal 
adaptive filter results. Boxes and dashed line correspond to Mexican Hat Wavelet results. Triangles and dashed line correspond to 
Gaussian filter results. Both the MHW and Gaussian filter have a width of 33'. The lower panel shows the number of detections above 
the different a thresholds. The upper panel shows the ratio between spurious and true detections for the same a thresholds. 
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Point Sources at 4 a 

Optimal Pseudo-filter 




Figure 6. Positions of the detected sources with the optimal adaptive filter at a 4<r detection level. The sky is projected in Galactic 
coordinates. The dashed line represents the Galactic Equator. The optimal adaptive filter is able to detect sources even in the highly 
contaminated region around the Galactic Plane. 
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